Search results for "pprentissage par renforcement"

showing 1 items of 1 documents

Échantillonnage adaptatif optimal dans les champs de Markov, application à l’échantillonnage d’une espèce adventice

2012

This work is divided into two parts: (i) the theoretical study of the problem of adaptive sampling in Markov Random Fields (MRF) and (ii) the modeling of the problem of weed sampling in a crop field and the design of adaptive sampling strategies for this problem. For the first point, we first modeled the problem of finding an optimal sampling strategy as a finite horizon Markov Decision Process (MDP). Then, we proposed a generic algorithm for computing an approximate solution to any finite horizon MDP with known model. This algorithm, called Least-Squared Dynamic Programming (LSDP), combines the concepts of dynamic programming and reinforcement learning. It was then adapted to compute adapt…

[SDE] Environmental Sciencesdynamic programmingreinforcement learningMarkov random field[SDV]Life Sciences [q-bio]pprentissage par renforcement[SDV] Life Sciences [q-bio]batchprogrammation dynamiquesampling costprocessus décisionnel de Markov[SDE]Environmental Sciencescoût d'échantillonnageMarkov decision processchamp de Markovadventiceweedéchantillonage adaptatif
researchProduct